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Abstract 


Many students at-risk for or identified with reading disabilities need intensive reading 
interventions. This meta-analysis provides an update to the Wanzek and Vaughn (2007) 
synthesis on intensive early reading interventions. Effects from 25 reading intervention studies 
are analyzed to examine the overall effect of intensive early reading interventions as well as 
relationships between intervention and student characteristics related to outcomes. The weighted 
mean effect size estimate (ES = 0.39), with a mean effect size adjusted for publication bias (ES = 
0.28), both significantly different from zero, suggested intensive, early reading interventions 
resulted in positive outcomes for early struggling readers in kindergarten through third grades. 
There was no statistically significant or meaningful heterogeneity in the study-wise effect sizes. 
Exploratory examination of time in intervention, instructional group size, initial reading 


achievement, and date of publication are provided. 
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Current Evidence on the Effects of Intensive Early Reading Interventions 

While intensive interventions can be implemented in any academic domain, reading is a 
primary target for intensive interventions in the primary grades. The primary grades are 
particularly important because many reading difficulties and disabilities can be prevented if 
students are provided with early reading intervention (Fuchs, Compton, Fuchs, Bryant, & Davis, 
2008; Partanen & Siegel, 2014; Simmons et al., 2008; Torgesen et al., 2001; Vellutino, Scanlon, 
Small, & Fanuele, 2006). One way to identify students in need of intensive intervention is to 
examine reading achievement prior to intervention. Research has noted that very low levels of 
initial reading achievement predict later low levels of reading achievement even when these 
students are provided less intensive Tier 2 type interventions (Al Otaiba & Fuchs, 2002; Lam & 
McMaster, 2014; Nelson, Benner, & Gonzalez, 2003; Vaughn & Linan-Thompson, 2003). 
Deficits in phonological awareness, rapid naming, fluency, and the alphabetic principle appear to 
be the most consistent predictors of initial response to intervention (Al Otaiba & Fuchs, 2002; 
Lam & McMaster, 2014; Nelson et al., 2003). 

Furthermore, syntheses of the impact of reading interventions provided in the primary 
grades report higher average impacts on reading outcomes than interventions implemented in the 
upper elementary and secondary grades (Scammacca, Roberts, Vaughn, & Stuebing, 2015; 
Wanzek & Vaughn, 2007; Wanzek et al., 2013; Wanzek, Wexler, Vaughn, & Ciullo, 2010). If 
students do not develop strong reading skills in the primary grades, they will most likely 
continue to have difficulty reading throughout school (Francis, Shaywitz, Stuebing, Shaywitz, & 
Fletcher, 1996; Juel, 1988; McNamara, Scissons, & Gutknectch, 2011; Stanovich, 1986) and will 


be at an increased risk for dropping out of school (Hernandez, 2011). 


EFFECTS OF INTENSIVE EARLY READING INTERVENTIONS 5 


Over the past 10 years, there has been an increased emphasis on providing intensive 
reading interventions. In 2011, the Office of Special Education Programs funded the National 
Center on Intensive Intervention (www. intensiveintervention.org), which provides technical 
assistance to districts to support their use of intensive interventions. Their work includes the use 
of technical review committees to evaluate intervention programs, assessment tools, and 
implementation strategies, and to evaluate the use of intensive interventions in current schools. 
Furthermore, the Division for Learning Disabilities (DLD) published a position statement 
recommending that students at-risk for and with learning disabilities (LD) receive intensive 
evidence-based interventions in areas of need in addition to accommodations and modifications 
in order to be able to fully access the general curriculum (Vaughn, Zumeta, et al., 2014). 

One way previous research has conceptualized intensive intervention, is to provide more 
time in intervention. Increasing session length or the duration of the intervention may allow 
students with significant reading needs to accelerate their learning (Denton, Fletcher, Anthony & 
Francis, 2006; Torgesen et al., 2001; Vaughn, Linan-Thompson, & Hickman, 2003). Other 
studies have provided a more intensive reading intervention by reducing the size of the 
instructional group for intervention (Lou et al., 1996; Vaughn, Linan-Thompson, Kouzekanani, 
et al., 2003). In these studies, students receiving reading intervention in small groups 
outperformed students receiving reading intervention in larger groups. This research on 
intensifying interventions through time, dosage, or instructional group size became the basis for 
the quantitative ways to increase intensity of intervention that are commonly used in 
instructional models with multi-tiered intervention levels (National Center on Intensive 
Intervention, 2012). In 2008, the Institute of Education Sciences summarized recent reading 


intervention research and recommended that students with minimal response to Tier 2 
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interventions receive intensive daily reading intervention (Gersten et al., 2008). However, this 
recommendation had low evidence when evaluated using the What Works Clearinghouse 
standards. Only five studies were available for review at that time and there were no significant 
effects, indicating a clear need for updated review of the RTI literature particularly in the area of 
intensive reading interventions. The Center on Instruction released a practice guide on intensive 
interventions in reading and mathematics (Vaughn, Wanzek, Murray, & Roberts, 2012). This 
guide provided specific recommendations based on the available research for intensifying 
interventions through: (a) the use of strategies that promote cognitive processes, (b) delivering 
more explicit and systematic instruction in addition to increased opportunities for feedback, (c) 
providing additional instructional time, and (d) decreasing group size (Vaughn et al., 2012). This 
guide emphasized the way in which intensive interventions could be implemented in school 
settings; however, it did not review the efficacy studies of intensive reading interventions. 

Wanzek and Vaughn (2007) conducted a systematic review of intensive reading 
interventions (i.e., those with 100 sessions or more) in grades K-3 from 1995 to 2005 in order to 
inform the implementation of intensive interventions within an RTI framework. They identified 
13 studies that met criteria for inclusion in the review and determined that the intensive 
interventions resulted in generally positive effects on reading outcomes (Mean ES range = -0.05 
to 0.84). Additionally, the researchers synthesized intervention effects by duration, group size, 
grade level, and level of individualization, although these variables were not systematically 
manipulated in the original studies. The duration of the intervention did not appear to be related 
to the magnitude of the intervention effect. Studies of intensive interventions taught individually 
(1:1) reported greater effects than those taught in groups (2 to 8 students). There were not 


enough studies with small groups (2 to 4 students) for the authors to examine the effects of small 
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group implementation. Grade level was related to the magnitude of intervention effect, with 
larger effects reported in studies in grades K-1 than in Grades 2-3. Vaughn and Wanzek also 
sought to examine the effects of standardized and individualized interventions, but they were 
unable to locate any intensive reading interventions utilizing an individualized approach to 
instruction. There were no differences in effects on student reading outcomes for studies 
implementing highly standardized versus less standardized interventions. 

Additional research on intensive reading interventions has been conducted since Wanzek 
& Vaughn (2007) was published that can provide updated information on the effects of these 
intensity variables in reading intervention. 
Purpose and Rationale 

Many students at-risk for or identified with reading disabilities need intensive reading 
interventions (Fuchs, Fuchs, & Vaughn, 2014; Vaughn, Denton, & Fletcher, 2010; Vaughn & 
Wanzek, 2014). Practical recommendations for intensifying reading interventions continue to 
include examination of intervention duration, instructional group size, and individualization 
(National Center on Intensive Intervention, 2012). Since the Wanzek and Vaughn (2007) 
synthesis on intensive early reading interventions, additional research has been conducted 
examining the efficacy of reading interventions in Grades K-3. The current synthesis aims to 
update and extend the findings from the previous synthesis on intensive early reading 
interventions (Wanzek & Vaughn, 2007). We systematically searched the literature for studies 
conducted since the end of the last synthesis search (2005) using the same criteria as Wanzek and 
Vaughn in order to provide an updated corpus of studies on intensive reading interventions from 
1995 to 2015. We extended the previous synthesis by conducting a meta-analysis of the effects 


of intensive interventions and sought to examine duration of intervention, instructional group 
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size, grade level, and individualization of the intervention as moderators of these effects just as 
were examined originally in Vaughn and Wanzek. We also included publication year as a 
moderator to examine possible change over time in the effects of the research from the original 
synthesis to the present. Finally, we extended the previous synthesis by examining the effects of 
intervention by initial reading achievement given its general relationship to intervention 
responsiveness in the research. Specifically, we addressed three research questions: 

1) What are the effects of intensive early reading interventions for students with reading 

difficulties? 
2) What intervention or student characteristics are related to student outcomes? 
3) Does publication date predict the magnitude of effects of intensive early reading 
interventions for students with reading difficulties? 
Method 

A comprehensive search of the literature was conducted to identify articles that met 
inclusion criteria. First, we included all of the identified studies from the initial synthesis 
(Wanzek & Vaughn, 2007) with sufficient data for calculating effect sizes. Second, we 
conducted an updated search of PsycINFO and ERIC to identify studies published since the 
original synthesis (2006 and 2015). Abstracts were searched utilizing terms related to reading 
intervention (reading interven*, reading instruction, reading strategies, supplemental 
instruction, special educ*, phon*, fluency, vocab*, comp*) in combination with search terms 
intended to target our key population (reading difficult*, learning disab*, reading disab*, 
reading delays, reading disorder*, dyslex*). Additionally, a hand search was conducted of 
relevant reading and learning disabilities related journals (Exceptional Children, Journal of 


Learning Disabilities, Journal of Special Education, Learning Disabilities Research and 
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Practice, Reading Research Quarterly, Scientific Studies of Reading, Reading and Writing, 
Journal of Research in Reading, Journal of Literacy Research, Reading Teacher, Reading and 
Writing Quarterly, Learning Disability Quarterly, Remedial and Special Education, Dyslexia, 
Annals of Dyslexia) in 2015 to ensure all recently published articles were reviewed. 

Our initial updated search identified 8,039 abstracts for screening. We applied the same 
inclusion criteria used in Wanzek and Vaughn (2007): 

1. The study was reported in a peer-reviewed journal and printed in English. 

2. The participants included students with learning disabilities or students identified 
as at risk for reading difficulties (e.g., students with low ability, low phonemic 
awareness, language disorders); studies with additional participants were include 
if disaggregated data were provided for the students with learning disabilities or 
the students were identified as at risk. 

3. The participants were enrolled in Grades K-3 inclusive. 

4. Interventions targeted early literacy in an alphabetic language. 

5. Instruction was provided as part of school programming (i.e., not including home, 
clinic, or camp programs). 

6. Reading outcomes were measured in the study. 

In addition to the criteria applied from the Wanzek and Vaughn (2007) synthesis, we also 
selected only those studies with sufficient data to calculate effect sizes between study groups so 
that we could conduct the meta-analysis. 

Abstract review led to disqualification of 7,638 studies from the initial updated search 
(e.g., not intervention studies, age of participants outside of criterion). The full text of the 


remaining 401 articles was read. Of those articles, 13 studies met all inclusion criteria. These 13 
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studies from the updated search along with 12 studies from Wanzek and Vaughn (2007) with 
sufficient data to calculate effect sizes between study groups were included. Thus, a total of 25 
studies were included in the meta-analysis. 
Coding Procedures 
A detailed coding sheet was utilized to organize relevant information about each study 

(Vaughn, Elbaum, Wanzek, Scammacca, & Walker, 2014). The code sheet was based on 
elements specified in the What Works Clearinghouse Design and Implementation Assessment 
Device (Institute of Education Sciences, 2008) as well as coding information utilized in previous 
research. Data were collected on (a) participants (e.g. SES, risk type, gender); (b) research 
design and methodology; (c) description of experimental and control conditions; (d) clarity of 
causal inference; (e) measures; and (f) results. There were four trained coders. A gold standard 
method of coding was used with each coder demonstrating reliability to the first author before 
beginning coder. Intercoder agreement was calculated separately for each category on the 
codesheet (e.g., participants, measures, results) and reached 90% or above for all categories 
(range 92-100%). Intercoder agreement was calculated as the number of agreements divided by 
the number of agreements plus disagreements. In addition, all code sheets were independently 
coded by two coders. Any discrepancies were discussed until consensus was reached. 
Effect Size Calculation 

For all studies, Hedges’s g was calculated using the means and standard deviations for each 
group when such data were provided. Cohen’s d effect sizes and sample sizes for each group 
were used to calculate Hedges’s g when means and standard deviations were not reported. All 
effects were computed using the Comprehensive Meta Analysis (Version 3.3.070) software 


(Borenstein, Hedges, Higgins, & Rothstein, 2013). 
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Meta-analysis Procedures 

Standardized vs. unstandardized measures. Based on previous research that showed 
that effect sizes in reading intervention studies differ in magnitude based on whether the outcome 
measures are standardized or not (Scammacca et al., 2015; Swanson, Hoskyn, & Lee 1999), we 
considered conducting separate meta-analyses for each type of measure. However, in the corpus 
of studies that met the inclusion criteria, only 24 of the 328 effect sizes were calculated from 
scores on unstandardized measures. All of the 24 instances involved studies where both 
standardized and unstandardized measures were used to estimate treatment effects. Therefore, 
we decided to include both types of measures in the meta-analysis and conduct a sensitivity 
analysis to determine if results would differ if the unstandardized measures were not included. 

Treatment-comparison vs. multiple treatment studies. Additionally, we decided to 
meta-analyze effect sizes from both studies that used a treatment-comparison contrast (k = 19) 
and studies that contrasted two or more treatments (k = 6). This decision was based on several 
factors. First, the comparison condition in the treatment-comparison studies typically was 
business-as-usual classroom instruction, not a no-treatment control. Therefore, it seemed 
reasonable to view the comparison group’s instruction as an alternate treatment (albeit one not 
dictated by the researchers). Also, when mean effect sizes for the two study designs were 
compared, no significant difference was found (p = .76), leading us to determine that including 
both types of studies likely would not add systematic heterogeneity to the meta-analytic results. 
Finally, including both types of studies provided additional power for the meta-analysis. 

Analytic approach. Given that 328 effect sizes were calculated from 25 studies, it is 
apparent that most studies contained data for computing more than one effect size. The mean 


number of effect sizes per study was 13.12, with a range of 3 to 42. Multiple effect sizes resulted 
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from multiple outcome measures being used to estimate the treatment effect and from studies 
that involved more than one pair of group contrasts. When multiple effect sizes are computed 
from a single study, the meta-analytic data contain dependencies that must be accounted for in 
the analysis. 

To accommodate the dependency in the data, robust variance estimation (RVE; Hedges, 
Tipton, & Johnson, 2010) was implemented to conduct the meta-analysis, using the robumeta 
package for R (Fisher & Tipton, 2015). RVE adjusts the standard errors of the effect size 
estimates to account for the correlation between effect sizes within studies. RVE requires that 
the mean correlation between all pairs of effect sizes within a study (p) be specified in order to 
estimate the study weights and calculate the between-study variance. As shown by Hedges et al. 
(2010), the value selected for p generally does not yield meaningful differences in the results; 
they recommended conducting a sensitivity analysis in which varying p values are used in the 
meta-analytic models. Using .2, .5, and .8, we found no differences in the results of the meta- 
analysis. The results reported below used a p of .8. 

An additional consideration when using RVE is the increase in Type I error rates that has 
been found when the number of studies included in the meta-analysis is less than 40 (Tipton, 
2015). Because this meta-analysis included 25 studies, the small-sample correction developed 
by Tipton (2015) was implemented when the model was run in robumeta. Tipton (2015) showed 
that this correction prevents inflation of Type I error rates. An intercept-only meta-regression 
model was run to estimate the overall mean effect size and standard error and to calculate indices 
of heterogeneity. 

Moderator analyses. We planned to conduct moderator analyses using the following 


variables in meta-regression models implemented with RVE: hours of treatment (less than or 
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equal to 63 hours vs. more than 63 hours); instructional group size (small group vs. one-on-one), 
grade level, individualization of the intervention, year of publication, and ability level of the 
sample based on pre-test standardized reading test scores. However, as Wanzek and Vaughn 
(2007) also encountered, the majority of the work on intensive interventions occurred in Grades 
K-1 and examined only standardized interventions, preventing examination of grade level and 
individualization of intervention as moderators of effects due to lack of variation across studies. 
Hours of treatment could not be operationalized as a continuous variable because this 
information was reported as a range or a Mean in many studies. As a result, we chose the median 
of the dataset as a dividing point to categorize the length of treatment. One study had a wide 
range of total hours of instruction that crossed the median and was therefore not included in the 
analysis for total hours. Standardized pre-test scores were reported in 12 of the 25 studies. For 
these studies, az score was calculated using the mean and standard deviation of the test’s 
normative sample to estimate the ability level of the study’s sample. This z score was used as a 
covariate in a meta-regression model to determine if pre-test ability level predicted the post-test 
effect size for the difference between groups. We recognized that power for the moderator 
analyses was low and considered these analyses to be exploratory in nature. 
Results 

Tables 1 and 2 provide the key features and descriptions of the independent and 
dependent variables for each study. The estimate of the mean effect size across the 25 studies 
included in the analysis using data from both standardized and unstandardized measures was 
0.39, and differed significantly from zero (p < .001, 95% CI = 0.30, 0.48). The J? estimate of the 
percentage of between-study heterogeneity not due to chance variation in effects was 0.00%, 


with a 7’ estimate of the true variance in the population of effects of 0.00. The estimate of the 
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mean effect, the 95% confidence interval, and the heterogeneity statistics did not change when 
unstandardized measures were dropped from the dataset. These results indicate that there was no 
statistically significant or meaningful heterogeneity in the study-wise effect sizes, meaning that 
moderator analyses were not warranted. However, for exploratory purposes we calculated 
intercept-only models to determine the mean effect size at each level of the categorical 
moderator variables we had selected. See Table 3 for results. Additionally, a meta-regression 
model using pre-test reading ability as a covariate predicting effect sizes across the 12 studies 
that included pre-test data from standardized measures indicated that pre-test z scores did not 
predict effect size (6 = -.03, SE = .11, df = 3, p = .78). However, RVE estimates from models 
with fewer than four degrees of freedom are unreliable (Tipton, 2015), making the results for this 
analysis inconclusive. 

Lastly, we explored year of publication as a covariate predicting effect size across all 25 
studies. Results from the meta-regression model indicated that year of publication did not 
predict effect size (6 = -.01, SE = .01, df = 12, p = .20). 

Publication Bias 

Because unpublished studies were not included in this meta-analysis, publication bias 
threatens the validity of our results and its impact must be explored. Using the trim-and-fill 
method (Duval & Tweedie, 2000), we evaluated the potential impact of publication bias based on 
a random effects model. The trim-and-fill method removes effect sizes that cause asymmetry in 
a funnel plot of the effect sizes included in the meta-analysis, calculates a mean effect, and then 
imputes the effect sizes needed to make the plot symmetrical. The results indicate how many 
studies may be missing from the meta-analysis due to publication bias and produces an adjusted 


effect size based on including the missing studies. Results of the trim-and-fill analysis for the 
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present meta-analysis found that publication bias might have inflated the mean effect size 
estimate. Seven studies with effect sizes that were smaller than the mean effect of 0.39 likely 
were missing from the dataset. When effect sizes from these missing studies were included, the 
adjusted mean effect size was 0.28 (95% CI = 0.20, 0.37). 
Discussion 

The purpose of the current study was to conduct a meta-analysis of the research on 
intensive early reading interventions (i.e., provided for 100 or more sessions) for students in 
Grades K through 3. Specifically, we sought to update and extend Wanzek and Vaughn’s (2007) 
synthesis of intensive interventions too allow examination of a larger corpus of studies from 
1995-2005. We examined intervention and student characteristics that may be related to student 
outcomes in order to provide educators and researchers with the current state of evidence on the 
implementation of intensive interventions. We also examined whether there were systematic 


changes in effects of these studies since the previous synthesis. 


The Effects of Intensive Early Reading Interventions 

The weighted mean effect size estimate (ES = 0.39), with a mean effect size adjusted for 
publication bias (ES = 0.28), both significantly different from zero, suggests intensive, early 
reading interventions result in positive outcomes for early struggling readers in kindergarten 
through third grades. Students in these grades receiving intensive interventions like the ones in 
this meta-analysis may make improvements in reading by as much as four tenths of a standard 
deviation. The effects of these intensive early reading interventions are similar to those reported 
for less intensive early reading interventions of less than 100 sessions (Wanzek et al., 2016). 


These findings suggest a variety of early reading interventions can improve student reading 
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outcomes. The study of less intensive interventions did yield significant variance in effects 
among the studies but it was not explained by intervention type, instructional group size, grade 
level, implementer, or total hours of intervention. The corpus of intensive early reading 
intervention did not yield significant variance in effects, further suggesting these interventions 
can assist students in improving their reading outcomes. We discuss the common elements of 
these studies that may have led to the lack of significant variance in the effects. 

Over 90% of the effect sizes contributing to the weighted mean effect size estimate in the 
current study were calculated from results on standardized measures; this enhances our 
confidence that the findings were associated with meaningful gains in reading, as standardized 
measures are more representative of generalized reading skill than proximal, specialized 
measures that are often aligned to the intervention (Lipsey et al., 2012). These effect sizes are 
also larger than typical effects for elementary academic interventions when outcomes are 
measured on standardized measures, which range from 0.08 for broad subject matter measures to 
0.25 for more narrow standardized measures on specific components of reading (Lipsey et al., 
2012). The previous intensive reading intervention synthesis did not report a weighted mean 
estimate, but reported study-wise mean effects ranging from -0.05 to 0.84 (Wanzek & Vaughn, 
2007). 

We hypothesized that publication year would significantly predict effect size, suggesting 
the effects of interventions conducted more recently may yield less impact, a finding reported in 
previous research for reading interventions with older students (Scammacca et al., 2015). 
However, our moderator analysis indicated publication year did not significantly predict effect 
size, suggesting there is no difference in the impact of intensive interventions across the twenty- 


year time span. One reason for this might be that nearly all of the measures included in this 
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corpus were standardized. Scammacca et al. (2015) also reported no moderation of publication 
year when only standardized measures were included in the moderation model. It appears that 
across time, intensive interventions like those included in this meta-analysis result in positive, 
generalized reading improvement for early struggling readers. 

Characteristics of Intensive Interventions Related to Student Outcomes 

We attempted to identify features of the interventions (e.g., group size, hours of 
intervention) included in this meta-analysis that were associated with increased effectiveness; 
however, we did not find significant heterogeneity in effect sizes across studies. The 
predominant use of standardized measures may be one reason for the lack of heterogeneity 
across study effects. We did calculate weighted mean effects for the intervention characteristics. 
We recognize that the results of these analyses may differ in a future dataset that has a larger 
sample of studies with more heterogeneity of variance in the corpus of effect sizes. 

The success of multi-tiered systems of support is predicated on the provision of 
increasingly intensive interventions in response to students’ needs. One suggestion for 
intensifying interventions when students respond limitedly to a less-intensive intervention is by 
reducing group size (Vaughn et al., 2012). In the previous synthesis, Vaughn and Wanzek 
(2007) reported effects for one-on-one instruction (M ES range 0.17 to 0.84) were generally 
larger than effects for group instruction (M ES range -0.05 to 0.39), but no statistical evaluation 
of reducing group size could be conducted. Consistent with those findings, this meta-analysis 
provides some limited evidence that students may respond more favorably to interventions 
provided in one-on-one settings (ES = 0.59) versus groups (i.e., 2-8 students; ES = 0.33). One 
reason for this finding may be that one-on-one instruction allows for heightened response (i.e., 


more frequent feedback, data-based instructional changes) to a student’s instructional needs that 
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may be more difficult to achieve with more than one student in a group. Also, most of the 
studies utilizing one-on-one instruction were with first graders and, thus, it is not possible to 
know whether this age group is more responsive to one-on-one instruction or whether the effects 
are simply due higher growth of first graders in these interventions. Unfortunately, the research 
base on intensive interventions does not yet have enough studies utilizing small groups (e.g., 2-4 
students) to examine these smaller groups versus larger instructional groups, a problem noted in 
the earlier synthesis as well. 

Another way to increase intervention intensity is by increasing the duration of the 
intervention (Vaughn et al., 2012). While all of the included studies had 100 or more sessions, 
the length of time for each session varied. In the previous review, the authors were unable to 
examine the effect of intervention hours due to limited information provided in the included 
articles. Results from the studies reported in the current meta-analysis suggest more studies 
reported information needed to determine the hours of intervention. The finding that 
interventions provided for more than 63 hrs as well as 25-63 hrs produced significant, moderate 
effects suggests that providing an extended number of sessions may be more important than the 
number of hours of intervention. 

Overall, the results of this meta-analysis indicate that intensive interventions result in 
positive gains in reading performance for struggling readers in Grades K through 3. There is 
limited variability in the effects, indicating that intervention commonalities may be driving the 
positive effects more than differences such as group size or duration. Intervention commonalities 
across studies included the following: (a) a high level of standardization in which all students 
received the same instruction using a set of well-prescribed lessons and materials for modeling 


and guiding students in learning new reading practices; (b) instructional content addressing 
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phonological awareness (e.g., syllable segmentation, phoneme identification and manipulation), 
phonics and word recognition (e.g., letter-name and letter-sound correspondence, blending and 
segmenting the sounds in words, reading decodable words and high frequency words), and 
fluency (e.g., initial reading, rereading, and shared reading of decodable texts); and (c) school 
staff or community members implemented the interventions. Thus, generally standardized, 
explicit instruction including reading foundational skills provided for more than 100 sessions has 
a positive effect for students with reading difficulties in Grades K-3. These interventions can be 
feasibly implemented by school personnel. 
Implications, Limitations, and Future Research 

The effective implementation of multi-tiered systems of support provides increasingly 
intensive intervention in response to student needs. This meta-analysis provides further evidence 
for school-based teams to consider as they make decisions about interventions for students with 
significant reading problems in grades kindergarten through third grade. This review provides a 
strong rationale for providing early intervention with the most effective instructional practices. 

This meta-analysis synthesizes causal evidence for the effects of intensive reading 
intervention on the reading outcomes of students in Grades K through 3 with reading difficulties. 
However, future research is needed to enhance our understanding of intensive interventions. 
One-on-one instruction may improve students’ response to intervention in primary grades, but 
may not necessarily be more effective than small group instruction, which was not implemented 
in enough studies for examination. Additional research examining small group instruction is 
needed to better ascertain the effect of intensity of intervention based on group size. Researchers 
might consider examining the long-term implications of one-on-one and small group instruction. 


Do the long-term benefits of one-on-one and small group instruction differ? Given the cost and 
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resource demands associated with one-on-one instruction, it may be more economically and 
practically feasible to implement small group intervention, particularly if the long-term benefits 
are similar. 

Additionally, researchers might consider investigating the effects of intensive 
interventions specifically for second and third grade students. The previous synthesis reported 
13 studies investigating interventions that began in kindergarten and first grades and five studies 
investigating interventions that began in second and third grades. All of the studies conducted 
since 2005 (k = 13) addressed struggling readers in Grades K and 1. As such, there is limited 
evidence available for the impact of intensive interventions on second and third grade students. 
Given that students in Grades 2 and 3 may present more challenging reading difficulties, it is 
important to explore the most effective intervention for these learners. Furthermore, the majority 
of the more recent studies provided instruction in phonological awareness, phonics and word 
recognition, and fluency. Only five studies included comprehension instruction (Denton et al., 
2010; Harn, Linan-Thompson, & Roberts, 2008; Little et al., 2012; Simmons et al., 2007; 
Vaughn et al., 2006). Struggling readers in Grades 2 and 3 may benefit from multicomponent 
interventions that include explicit comprehension and vocabulary components in addition to 
word recognition and fluency. 

The current meta-analysis also highlighted the continued lack of research on 
individualized interventions. Researchers might examine the impact of non-standardized 
approaches compared to standardized interventions. Standardized interventions are defined as a 
set of well-prescribed lessons and materials. The level of standardization may be high, such that 
all students in the group receive the same instruction, or low, meaning that interventionists still 


provide the same lessons to all students, but they may make minor adjustments based on student 


EFFECTS OF INTENSIVE EARLY READING INTERVENTIONS 21 


levels (Wanzek & Vaughn, 2007). In contrast to standardized interventions, an individualized, 
or problem-solving approach, provides instruction based on student needs. This involves 
identifying the problem and potential causes, creating a plan to address the problem, 
implementing the plan, and then evaluating the plan. This instructional process may also be 
referred to as data-based individualization. As with the previous corpus (i.e., 1995-2005), we 
identified no kindergarten through third grade studies conducted since 2005 that examined an 
individualized approach to intensive intervention. It is important to identify the relative impact 
of standardized versus problem-solving approaches. 

Due to a limited number of studies (k = 12) reporting standardized pre-test scores in 
reading, we were unable to fully examine pre-test reading ability as a predictor of intervention 
effectiveness. Our findings, though not considered reliable due to the small number of studies, 
indicated no differences in effects of the intensive interventions based on students’ incoming 
levels of achievement. We recommend that researchers report scores on standardized pre-test 
measures so that the role of initial reading level in treatment effects can be more fully explored 
across studies. Understanding intervention effects related to specific reading levels at pre-test 
may allow educators to better match intensive intervention to students with identified reading 


levels prior to receiving Tier 3 type interventions. 
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Table 1 


Features of Intervention 


Study N Grade Frequency Duration Group Implementer 
(in weeks) ae 
Coyne, Little et al. (2013) 162 AR K Daily, 30 min 25 3to5 General Education Teacher 
Coyne, Simmons et al. (2013) 103 AR K Daily; 30 min 25 3to5 MI 
Denton et al. (2010) 422 AR/LD/ 1 Daily; 40 min 25 3to4 General Education Teacher 
Disab. 
Fein et al. (2015) 267 AR 1 Daily; 30 min 26 3to5 MI 
Foorman et al. (1997) 114LD 2to3 Daily; 60 min 36 8 Special Education Teacher 
Gunn et al. (2000) 256 SR Kto3 Daily; 25-30min 60to69 1to3 Researcher 
Hagan-Burke et al. (2011) 206 AR K Daily; 30 min 21 3to5 MI 
Harn et al. (2008) 54 AR 1 Daily; 30 or 60 24to25 3tos MI-R 
min 
Hatcher et al. (2006) 77 SR K Daily; 20 min 10 or 20 1-3 Paraprofessional 
Jenkins (2004) 99 SR 1 4x week; 30 min 25 i Paraprofessional 
Little et al. (2012) 90 LD/*Disab. K Daily; 30 min 16 to 27 5 Reading Interventionist 
Mathes et al. (2005) 389 AR 1 Daily; 40 min 35 3 Researcher 
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Study N Grade Frequency Duration Implementer 

(in weeks) 
Miller (2003) 65 SR 1 4x week; 40 min 36 Paraprofessional 
Morris et al. (2000) 86 SR 1 Daily; 30 min 32 MI 
Santa & Hoien (1999) 49 SR 1 Daily; 30 min 35 MI 
Schneider et al. (2000) 253 AR K Daily; 10-20 min 10to20 5to8 General Education Teacher 
Simmons et al. (2007) 112 AR K Daily; 30 min 21.6 MI 
Simmons et al. (2011) 206 AR K Daily; 30 min 20 MI 
Torgesen et al. (1999) 180 SR Ktol 4x week; 20 min 130 MI 
Torgesen et al. (2010) 112 AR 1 4x week; 50min 24to 26 Researcher 
Vadasy et al. (2005) 57 Disab. 1 4x week; 30 min a2 Paraprofessional 
Vadasy et al. (1997) 35 AR 1 4x week; 30 min 27 Community Volunteers 
Vadasy et al. (2002) 65 AR 1to2 4xweek; 30min 35or70 Paraprofessional 
Vaughn et al. (2006) 48 AR 1 Daily; 50 min 18 to 28 Researcher 
Wang & Algozzine (2008) 139 AR 1 NR; 10-15 min NR Paraprofessional 


Note: SR = struggling readers; AR = at risk; LD = learning disabilities; Disab. = disabilities, type not specified; NR = not reported; MI 
= multiple school level implementers; *Sample included several disability types 
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Table 2 


Description of Interventions 


Study 


Measures 


Coyne, Little et al. (2013) 
T (Early Reading Intervention): Instruction in PA and writing and 
spelling with previously taught phonics skills. 
C (Harcourt Trophies): Typical school intervention using phonics, 
guided reading, and PA 


Coyne, Simmons et al. (2013) 
T1 (Early Reading Intervention): Instruction in PA, writing, and 
spelling with previously taught phonics skills 
T2 (Early Reading Intervention with adjustments): Implemented ERI 
but used in-program assessments to measure mastery then 
systematically target specific skills, regrouped students, and made 
instructional changes 


Denton et al. (2010) 
T (Responsive Reading Instruction): Instruction in PA, phonics, WR, 
fluency, and comprehension 
C (Typical School Practice): Typical reading instruction 


Fein et al. (2015) 
T (Tier 2 intervention): Instruction in PA, phonics/WR, and fluency 
C (Business as usual): Standard district program 


Foorman et al. (1997) 


T1 (Synthetic Phonics): Based on Orton-Gillingham approach. Multi- 


sensory instruction in PA and phonics. 

T2 (Analytic Phonics): Scripted instruction in phonics and fluency. 
T3 (Edmark Reading Program): Instruction in WR, fluency, and 
spelling. 


WRMT/NU (letter name and sound checklists, WA, 


WID); PPVT; DIBELS (PSF and NWF); CTOPP (Sound 


Matching, Blending Words) 


WRMT/NU (Letter ID, WA, Letter Sound Checklist, 
PC); PPVT-IH; CTOPP (Sound Matching, Blending 
Words); DIBELS (PSF, NWF, ORF); Test of Written 
Spelling 


WJ-III (LID, WA, PC, Spelling); CTOPP (BW, 
Segmenting Words): TOWRE (SWE, PDE); 
Comprehensive Monitoring of Early Reading Skills 
(ORF) 


DIBELS (NWF, ORF); WRMT (Basic Skills Cluster, 
Total Reading); SAT 10 


Orthographic processing, phonological analysis, and 
word reading 
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Study 


Gunn et al. (2000) 
T (Reading Mastery or Corrective Reading): Explicit instruction in 
PA, letter-sound correspondence, and blending. Students placed 
based on reading level. 
C (Control): No supplemental instruction. 


Hagan-Burke et al. (2011) 
T (Explicit, Systematic, Code-Based Instruction): Instruction in PA, 
writing and spelling. 
C (School Designated Instruction): Typical school instruction 


Harn et al. (2008) 
T1 (Less intensive intervention): Instruction in phonics and WR, 
fluency, passage reading, and comprehension. 30-minute sessions. 
T2 (More intensive intervention): Instruction in phonics and WR, 
fluency, passage reading, and comprehension. 60-minute sessions. 


Hatcher et al. (2006) 
T1 (Modified Sound-Linkage Reading Intervention): Group 
instruction in letter ID, PA and writing. Individual instruction in 
phonics and fluency for 20 weeks. 
T2 (Modified Sound-Linkage Reading Intervention): Group 
instruction in letter identification, PA and writing. Individual 
instruction in phonics and fluency for 10 weeks. 


Jenkins (2004) 
T1 (Sound Partners- More Decodable Texts): Instruction in PA, 
phonics, WR and spelling using more decodable texts. 
T2 (Sound Partners-Less Decodable Texts): Instruction in PA, 
phonics, WR and spelling using less decodable texts. 
C (Typical Practice): Typical classroom instruction 


Measures 


WJ-R ACH (WA, LID, PC, Reading Vocabulary); 
DIBELS (ORF) 


CTOPP (Sound Matching, Blending Words); WRMT-R 
(Letter Name Checklist, Letter Sound Checklist, WA, 
WID) 


WRMT-R (WA, WID, PC); TOWRE (SWE, PDE) 
DIBELS (NWE, ORF) 


British Picture Vocabulary Test; Phonological Abilities 
Test; Sound Linkage Test of Phonological Awareness; 
Letter Identification; Early Word Recognition Test; 
British Ability Scales, Word Reading Test 


PPVT-R; CTOPP (RLN, NWR); Yopp-Singer 
Segmentation test; Modified Rosner’s deletion test; 
WRMIT-R (WA); Diagnostic Test of Basic Decoding 
Skills; WRAT-R (Reading, Spelling); WRMT-R (WID); 
TOWRE; 

*Phonetically controlled passage (fluency, accuracy); 
*Non-phonetically controlled passage (fluency, 
accuracy) 
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Study 
Little et al. (2012) 


DIBELS (PSF, NWF, ORF); WRMT-R (Letter Name 


T (Early Reading Intervention): Instruction in PA and writing and 
spelling with previously taught phonics skills. Student progress 
assessed every 4 weeks to determine adequate progress and make 
adjustments. 

C (School Designated Instruction): Typical school instruction 


Mathes et al. (2005) 
T1 (Proactive Reading): Explicit instruction in phonics/WR (in 
isolation), fluency, and comprehension 
T2 (Reactive Reading): Instruction in phonics/WR (not in isolation), 
fluency, and comprehension 
C (Enhanced Instruction): Teachers were given progress monitoring 
data and trained to use data to inform their typical instruction 


Miller (2003) 
T1 (Partner in Reading): Instruction in phonics/WR and 
comprehension 
T2 (Reading Recovery): Instruction in phonics/WR and 
comprehension 


C (School Designated Instruction): Typical classroom instruction 


Morris et al. (2000) 
T (Early Steps):Instruction in phonics/WR, fluency, and 
Comprehension 
C (School Designated Instruction): Typical classroom instruction 


Santa & Hoien (1999) 
T (Early Steps): Instruction in phonics/WR, fluency, and 
comprehension 
C (School Designated Instruction): Typical classroom instruction 


Schneider et al. (2000) 
T1 (PA Training): Instruction in PA and phonics/WR 


Measures 


and Sounds checklist, WID, WA, Letter ID); PPVT-III; 
CTOPP (Sound Matching, Blending Words); Test of 
Written Spelling-4 


W4J-III (WA, WID, PC, Spelling, Reading Fluency); 
CRAB-R (Fluency, Comprehension) 


Metropolitan Achievement Test (Word Recognition, 
Comprehension, Vocabulary, Language) 


WRMT (WA, PC); *Word Recognition, *Spelling, 
*Passage Reading 


*Spelling, *Word Recognition, Early Steps Passage 
Reading 


Lundberg and Wimmer Metalinguistic Battery (Initial 
Sound Analysis, Identification of End Sounds, New 
phoneme Analysis, Word Length Analysis, Supply of 
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Study 


T2 (Phonological Awareness and Letter Sound Training): Instruction 
in PA and phonics/WR. 

T3 (Letter Sound Training): Instruction in letter sound 
correspondence. 


Simmons et al. (2007) 


T1 (Highly Specified Design): Systematic instruction in PA, phonics, 
writing, and spelling. 

T2 (Highly Specified Design + Intervention): Systematic instruction 
in PA, phonics, writing, and spelling with instruction in 
comprehension. 

T3 (Moderately Specified Design): Instruction in PA, phonics, 
writing, and spelling. 


Simmons et al. (2011) 


T (Early Reading Intervention): Instruction in PA and writing and 
spelling with previously taught phonics skills. 
C (School Designated Instruction): Typical Kindergarten instruction 


Torgesen et al. (1999) 


T1 (Phonological Awareness Plus Synthetic Phonics): Explicit 
instruction in phonics and PA 

T2 (Embedded Phonics): Less intensive phonics instruction 
T3 (Regular Classroom Support): Individual tutoring based on 
regular classroom instruction 

C (No treatment): No additional tutoring 


Torgesen et al. (2010) 


T1 (Read, Write, and Type): Direct instruction in PA and 

phonics/WR with computer practice. 

T2 (Lindamood Phoneme Sequencing Program): Direct instruction in 
PA and phonics/WR with computer practice. 

C (No treatment): No additional instruction 


Measures 


Initial Consonant, Vowel Substitution); Wurzburger 
Leise Leseprobe test; Weingartener Basic Vocabulary 
Spelling Test: Diagnostic Spelling Test for Second 
Graders 


PPVT; DIBELS (Letter Name Fluency, PSF, NWF); 
Tangel & Blachman Spelling Test; Yopp Singer Test 
(Phoneme Segmentation); WRMT-R (WA, WID); Letter 
Dictation Fluency 


WRMT-R/NU (WID, Letter Name Checklist, Letter 
Sound Checklist, PC); DIBELS (NWF, PSF); Test of 
Written Spelling-4; TOWRE (PDE, SWE); CTOPP 
(Sound Matching, Blending Words) 


WRMT (WA, WID, PC); WRAT (Spelling); GORT-HI 
(Comprehension); Developmental Spelling WJPB 
Calculation 


WRMT (WID, WA, PC); TOWRE (SWE, PDE); GORT- 
3 (Reading Accuracy, Text Reading Fluency); CTOPP 
(BW, Segmenting Words, Rapid Naming Digits and 
Letters); Developmental Spelling Analysis; Stanford 
Binet Intelligence Scale 4" ed. (Vocabulary); WRAT 
(Spelling) 


EFFECTS OF INTENSIVE EARLY READING INTERVENTIONS 38 


Study 
Vadasy et al. (2005) 
T1 (Reading Practice): Intervention time split between instruction in 
phonics/WR and fluency. 
T2 (Word Study): Entire intervention time spent on phonics/WR 
C (School Designated Instruction) Typical classroom instruction 


Vadasy et al. (1997) 
T CUntervention): Instruction in PA, phonics/WR, and fluency 
C (School Designated Instruction): Typical classroom instruction 


Vadasy et al. (2002) 
T1 (Sound Partners & Thinking Partners): Instruction in PA, 
phonics/WR, fluency and comprehension 
T2 (Thinking Partners): Instruction in comprehension 
C (School Designated Instruction): Typical classroom instruction 


Vaughn et al. (2006) 
T (Reading and Language Development Intervention): Instruction in 
PA, phonics/WR, fluency, oral language/vocabulary, and 
comprehension 
C (School Designated Instruction): Typical classroom instruction 


Wang & Algozzine (2008) 
T (Supplemental Targeted Intervention): Direct instruction in PA, 
phonics/WR, and fluency 
C (School Designated Instruction): Typical classroom instruction 


Measures 

WRMT-R/NU (WA, WID, PC); WRAT-R (Reading, 
Spelling); TOWRE (PDE, SWE); Passage Reading 
Fluency, Passage Reading Accuracy 


Analytical Reading Inventory; Dolch Word Recognition 
Test; WRAT-R (Reading, Spelling) WJ-R (WA, WID); 
Yopp-Singer Segmentation Task; Writing Sample; 
Bryant Pseudo-Word Test 

WRAT-R (Reading, Spelling) WRMT (WID, WA) 
TOWRE (PDE, SWE); *Read Naturally Comprehension; 
Informal Reading Inventory 


WLPB-R (Picture Vocabulary, Verbal Analogies, 
Memory for Sentences, Oral Language Composite, WA, 
Dictation, PC); DIBELS (Letter name Identification, 
ORF, Letter Sound Identification); CTOPP (RLN, NWR) 


WRMT-R (WID, WA, PC); DIBELS (PSF, NWF) 


* Indicates unstandardized measure 


Note: PA = phonological awareness; WR = word recognition; WMRT/NU = Woodcock Reading Mastery Tests—Revised/Normative 
Update; WRMT-R = Woodcock Reading Mastery Tests -Revised; WRMT = Woodcock Reading Mastery Tests; PPVT = Peabody 
Picture Vocabulary Test; DIBELS = Dynamic Indicators of Basic Early Literacy Skills; CTOPP = Comprehensive Test of 
Phonological Processing; SAT 10 = Stanford Achievement Test—10th Edition; WJ-R = Woodcock Johnson-Revised; WJ-III = 
Woodcock Johnson II; WJ-R ACH = Woodcock Johnson Test of Achievement; TOWRE= Test of Word Reading Efficiency; CRAB- 
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R = Comprehensive Assessment of Reading Battery Revised for First-Grade; WJPB = Woodcock-Johnson Psycho-Educational 
Battery; WLPB-R = Woodcock Language Proficiency Battery-Revised; GORT-III = Gray Oral Reading Test III]; PSF=Phoneme 
Segmentation Fluency; NWF = Nonsense Word Fluency; LID= Letter Word Identification; WA = Word Attack; PC = Passage 
Comprehension; SWE = Sight Word Efficiency; PDE = Phonemic Decoding Efficiency; ORF = Oral Reading Fluency; RLN = Rapid 
Letter Naming, NWR = Non-word Repetition task; WID = Word Identification 
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Table 3 


Effect Size by Categorical Variables 


Coeff SE 95%CI D df P - n k 
Hours of Treatment 
63 or less 0.33 0.07 0.17 O50 .001 110 25.74 07) «+167 12 
More than 63 0.45 0.05 034 056 <.001 9 0 O 145 12 
Group Size 
Small group 0.33 0.05 0.23 043 <.001 15 2.92 01 220 17 
One-on-one 0.59 0.06 045 0.73 <.001 10 0 0 108 8 


Note. Coeff = Coefficient; n = number of effect sizes; k = number of studies 
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